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CORRESPONDING ADDRESS: 



ADDRESSEE: IDEC Pharmaceuticals Corporation f- c-/ ^ t ' 

STREET: 11099 N. Torrey Pines Road, #160 ''h(^_^ CL/tXkL , 

SJ^or^a P/^a^.^a^r^^4- 

opo v>^}s^ ^;^k 



CITY: 

STATE: California 
COUNTRY: USA 
ZIP: 92037 

COMPUTER READABLE FORM: 



MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

COMPUTER: Macintosh 

OPERATING SYSTEM: MS. DOS 

SOFTWARE: Microsoft Word 5.0 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/97 8 , 891A 
FILING DATE: 13 NOV 1992 
CLASSIFICATION: 424 



(viii) ATTORNEY/AGENT INFORMATION: 



NAME: Burgeon, Richard P. Jr. 
REGISTRATION NUMBER: 34,787 
REFERENCE/DOCKET NUMBER: 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: (619) 458-0600 
TELEFAX: (619) 546-9274 

INFORMATION FOR SEQ ID NO . : 1 : 

SEQUENCE CHARACTERISTICS: 

LENGTH: 8540 bases 
TYPE; nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: circular 
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53 

54 (ii) MOLECULE TYPE: DNA (genomic) 
55 

56 (iii) HYPOTHETICAL: no 
57 

58 (iv) ANTI- SENSE: no 
59 

60 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 1: 

61 

62 GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 
63 

64 AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 
65 

66 GGAGAAT6GG CGGAACTGGG C6GAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 
67 

68 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 
69 

70 GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 
71 

72 GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 3 60 

73 

74 AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 
75 

7 6 GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 
77 

78 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 
79 

80 TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 
81 

82 AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 
83 

84 ATGACCTTAT GGGACTTTCC TACTTGGCA6 TACATCTACG TATTAGTCAT CGCTATTACC 720 
85 

86 ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 
87 

88 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTG6CACCA AAATCAACGG 840 
89 

90 GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 
91 

92 CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 
93 

94 CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 
95 

96 CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 
97 

98 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 
99 

100 TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 
101 

102 CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 
103 

104 AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1320 



Page : 3 



Raw Sequence Listing 
Patent Application US/07/978, 891A 



04/08/93 
12:14:18 
S4313 .raw 



105 

106 TGCGAAGTCA CCCATCAG66 CCTGAGCTC6 CCCGTCACAA AGA6CTTCAA CA6G6GAGAG 
107 

108 TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
109 

110 GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGT6CCTT CTAGTTGCCA GCCATCTGTT 
111 

112 GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 
113 

114 TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 
115 

116 GGGGT6GGGC AGGACAGCAA GGGGGAGGAT TGG6AAGACA ATAGCAGGCA TGCTGGGGAT 
117 

118 GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 
119 

120 ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 
121 

122 TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 
123 

124 GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 
125 

126 CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACG6GACTTT CCAAAATGTC 
127 

128 GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 
129 

130 TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 
131 

132 ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CGCTAGCACC 
133 

134 AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 
135 

136 GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 
137 

138 GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 
139 

140 TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 
141 

142 AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGCAGAGCC CAAATCTTGT 
143 

144 GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
145 

146 TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
147 

148 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGT6GAC 
149 

150 G6C6TG6AG6 T6CATAATGC CAA6ACAAAG CC6CG6GAGG AGCA6TACAA CA6CAC6TAC 
151 

152 CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
153 

154 TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
155 

156 GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCC6GGATGA GCTGACCAAG 
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158 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGT6GAG 2940 
159 

160 TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 3000 
161 

162 GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 3060 
163 

164 AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 3120 
165 

166 CTCTCCCTGT CTCCGGGTAA ATGAGGATCC GTTAACGGTT ACCAACTACC TAGACTGGAT 3180 
167 

168 TCGTGACAAC ATGCGGCCGT GATATCTACG TATGATCAGC CTCGACTGTG CCTTCTAGTT 3240 
169 

170 GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3300 
171 

172 CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3360 
173 

174 CTATTCTGGG GGGTGGGGTG G6GCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 3420 
175 

17 6 GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG CGCTGGATCT 3480 
177 

178 CCCGATCCCC AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3540 
179 

180 AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3600 
181 

182 AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3660 
183 

184 GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3720 
185 

186 GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGA6G 3780 
187 

188 GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3840 
189 

190 CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3900 
191 

192 CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3960 
193 

194 GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 4020 
195 

196 GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 
197 

198 TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 4140 
199 

200 CCTGAGAAGA ATCGACCTTT AAAGGACA6A ATTAATATAG TTCTCAGTAG AGAACTCAAA 4200 
201 

202 GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 4260 
203 

204 GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 4320 
205 

206 TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCAT6CAG 4380 
207 

208 GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 
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CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 
TA6CTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 



209 

210 GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 
211 

212 GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 
213 

214 CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 
215 

216 TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 
217 

218 TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 
219 

220 GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 
221 
222 
223 
224 
225 

226 ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 
227 

228 TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 
229 
230 
231 
232 
233 

234 AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 
235 

236 TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 
237 

238 TCCGGCCGCT TGGGTGGA6A GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 
239 

240 CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 
241 

242 CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 
243 
244 
245 



GCCAGTGAGT GGCACAGCAT TCTAG6GAGA AATATGCTTG TCATCACCGA AGCCTGATTC 
CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 



CAC6ACGGGC GTTCCTTGCG CAGCT6TGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 



246 GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 
247 

248 GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 
249 

250 CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 
251 

252 TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 
253 
254 
255 
256 
257 

258 GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 
259 

260 GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 



CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 
CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 
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261 
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284 
285 
286 
287 
288 
289 
290 
291 
292 
293 
294 
295 
296 
297 
298 
299 
300 
301 
302 
303 
304 
305 
306 
307 
308 
309 
310 
311 
312 



GCAGCGCATC 
6AAATGACC6 
TTCTATGAAA 
CGCG6GGATC 
G6TTACAAAT 
TCTAGTTGTG 
ATCCCGTCGA 
CGCTCACAAT 
AATGAGTGAG 
ACCTGTCGTG 
TTGGGCGCTC 
GA6CGGTATC 
CAGGAAAGAA 
TGCTGGCGTT 
GTCAGAGGTG 
CCCTCGTGCG 
CTTCGGGAA6 
TCGTTCGCTC 
TATCCGGTAA 
CAGCCACTGG 
AGTGGTGGCC 
AGCCAGTTAC 
GTAGCGGTGG 
AAGATCCTTT 
GGATTTTGGT 
GAAGTTTTAA 



GCCTTCTATC 
ACCAAGCGAC 
GGTTGGGCTT 
TCATGCTGGA 
AAAGCAATAG 
GTTTGTCCAA 
GAGCTTG6CG 
TCCACACAAC 
CTAACTCACA 
CCAGCTGCAT 
TTCCGCTTCC 
AGCTCACTCA 
CATGTGAGCA 
TTTCCATAGG 
GCGAAACCCG 
CTCTCCTGTT 
CGTGGCGCTT 
CAAGCTGGGC 
CTATCGTCTT 
TAACAGGATT 
TAACTACGGC 
CTTCGGAAAA 
TTTTTTTGTT 
GATCTTTTCT 
CATGAGATTA 
ATCAATCTAA 



GCCTTCTTGA 
GCCCAACCTG 
CGGAATCGTT 
GTTCTTCGCC 
CATCACAAAT 
ACTCATCAAT 
TAATCATGGT 
ATACGAGCCG 
TTAATTGCGT 
TAATGAATC6 
TCGCTCACTG 
AAGGCGGTAA 
AAA6GCCAGC 
CTCCGCCCCC 
ACAGGACTAT 
CCGACCCTGC 
TCTCAATGCT 
TGTGTGCACG 
GAGTCCAACC 
AGCA6AGCGA 
TACACTAGAA 
AGA6TTGGTA 
TGCAAGCAGC 
ACG6GGTCTG 
TCAAAAAGGA 
AGTATATATG 



CGAGTTCTTC 
CCATCACGAG 
TTCCGGGACG 
CACCCCAACT 
TTCACAAATA 
CTATCTTATC 
CATAGCTGTT 
GAAGCATAAA 
TGCGCTCACT 
GCCAACGCGC 
ACTCGCTGCG 
TACGGTTATC 
AAAAGGCCAG 
CTGACGAGCA 
AAAGATACCA 
CGCTTACCGG 
CACGCTGTAG 
AACCCCCCGT 
CGGTAAGACA 
GGTATGTAGG 
GGACAGTATT 
GCTCTTGATC 
A6ATTACGCG 
ACGCTCAGTG 
TCTTCACCTA 
AGTAAACTTG 



TGA6CGGGAC 
ATTTCGATTC 
CCGGCTGGAT 
TGTTTATTGC 
AAGCATTTTT 
ATGTCTGGAT 
TCCTGTGTGA 
GTGTAAAGCC 
GCCCGCTTTC 
GGG6AGAGGC 
CTCGGTCGTT 
CACAGAATCA 
GAACC6TAAA 
TCACAAAAAT 
GGCGTTTCCC 
ATACCTGTCC 
GTATCTCAGT 
TCA6CCCGAC 
CGACTTATCG 
CGGTGCTACA 
TGGTATCTGC 
CGGCAAACAA 
CAGAAAAAAA 
GAACGAAAAC 
GATCCTTTTA 
GTCTGACAGT 



TCTGGGGTTC 
CACCGCCGCC 
GATCCTCCAG 
AGCTTATAAT 
TTCACTGCAT 
CGCGGCCGCG 
AATTGTTATC 
TGGGGTGCCT 
CAGTCGGGAA 
GGTTTGCGTA 
CGGCTGCGGC 
GGGGATAACG 
AAGGCCGCGT 
CGACGCTCAA 
CCTGGAAGCT 
GCCTTTCTCC 
TCGGTGTAGG 
CGCTGCGCCT 
CCACTGGCAG 
GAGTTCTTGA 
GCTCTGCTGA 
ACCACCGCTG 
GGATCTCAAG 
TCACGTTAAG 
AATTAAAAAT 
TACCAATGCT 
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313 

314 TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7620 
315 

316 TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7680 
317 

318 TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 7740 
319 

32 0 GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 7800 
321 

322 GTTGCC6GGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 7860 
323 

324 TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 7920 
325 

326 CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAA6CG GTTAGCTCCT 7980 
327 

328 TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 8040 
329 

330 CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 8100 
331 

332 AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 8160 
333 

334 CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 8220 
335 

336 AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 82 80 

337 

338 AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT 8340 
339 

340 GA6CAAAAAC A66AAG6CAA AATGCCGCAA AAAAG66AAT AAG6GC6ACA CG6AAATGTT 8400 
341 

342 GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCA6GGT TATTGTCTCA 8460 
343 

344 TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 8520 
345 

346 TTCCCCGAAA AGTGCCACCT 8540 

347 

348 

349 (3) INFORMATION FOR SEQ ID NO.: 2: 

350 

351 (i) SEQUENCE CHARACTERISTICS: 

352 

353 (A) LENGTH: 9209 bases 

354 (B) TYPE: nucleic acid 

355 (C) STRANDEDNESS: single 

356 (D) TOPOLOGY: circular 
357 

358 (ii) MOLECULE TYPE: DNA (genomic) 
359 

360 (iii) HYPOTHETICAL: no 
361 

362 (iv) ANTI-SENSE: no 
363 

364 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 2: 
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365 
366 

367 GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 
368 

369 AGGCCGA6GC GGCCTCG6CC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 
370 

371 G6AGAATG6G CGGAACT666 CGGAGTTAGG GGCGGGATGG GCG6AGTTAG GGGCGGGACT 180 
372 

373 ATGGTTGCTG ACTAATT6AG AT6CATGCTT TGCATACTTC TGCCTGCTGG GGA6CCTGGG 240 
374 

375 GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTT6CATACT TCTGCCTGCT 300 
376 

377 GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 360 
378 

379 AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 
380 

381 GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 
382 

383 ACGTCAATAA TGACGTATGT TCCCATAGTA AC6CCAATAG GGACTTTCCA TTGACGTCAA 540 
384 

385 TGGGTG6ACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 
386 

387 AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 
388 

389 ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 
390 

391 ATGGTGATGC GGTTTTG6CA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 
392 

393 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTT6GCACCA AAATCAACGG 840 
394 

395 GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 
396 

397 CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 
398 

399 CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 1020 
400 

401 GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 
402 

403 GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 1140 
404 

405 CACTGGTTCC AGCAGAA6CC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 
406 

407 CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 1260 
408 

409 ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 1320 
410 

411 AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AACGTACGGT GGCTGCACCA 1380 
412 

413 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTG6AACTGC CTCTGTTGTG 1440 
414 

415 TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1500 
416 
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417 CTCCAATCGG GTAACTCCCA GGAGA6TGTC ACA6A6CAGG ACA6CAAGGA CAGCACCTAC 
418 

419 AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 
420 

421 TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGA6CTTCAA CAGGGGAGAG 
422 

423 TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
424 

425 GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTA6TTGCCA GCCATCTGTT 
426 

427 GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 
428 

429 TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 
430 

431 G6GGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAA6ACA ATA6CAGGCA TGCTGGGGAT 
432 

433 GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 
434 

435 ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 
436 

437 TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 
438 

439 GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 
440 

441 CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 
442 

443 GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 
444 

445 TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 
446 

447 ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CCTGTCCCAG 
448 

449 GTACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 
450 

451 TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 
452 

453 GGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 
454 

455 CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCAGCAC AGCCTACATG 
456 

457 CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 
458 

459 TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 
460 

461 GCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
462 

463 GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 
464 

465 TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 



466 
467 
468 



GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 
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469 TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAA6G TGGACAA6AA AGCAGAGCCC 3120 
470 

471 AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 3180 
472 

473 CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 3240 
474 

475 GAGGTCACAT GC6TGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 3300 
476 

477 TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 3360 
478 

479 AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 3420 
480 

481 GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 3480 
482 

483 AAAGCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC TGCCCCCATC CCGGGATGAG 3540 
484 

485 CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG GCTTCTATCC CAGCGACATC 3600 
486 

487 GCCGTGGAGT GGGA6A6CAA TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG 3660 
488 

489 CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 3720 
490 

491 CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGCACAA CCACTACACG 3780 
492 

493 CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGAGGATCCG TTAACGGTTA CCAACTACCT 3 840 

494 

495 AGACTGGATT CGTGACAACA TGCGGCCGTG ATATCTACGT ATGATCAGCC TCGACTGTGC 3900 
496 

497 CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 3960 
498 

499 GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 4020 
500 

501 GGTGTCATTC TATTCTGGGG G6TGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 4080 
502 

503 ACAATAGCAG GCATGCTGGG GATGCGGTGG GCTCTATGGA ACCAGCTGGG GCTCGACAGC 4140 
504 

505 GCTGGATCTC CCGATCCCCA GCTTTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 4200 
506 

507 AGGAAAATTA ATTTTAACAC CAATTCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 4260 
508 

509 ATGCTTTAGA GACAGTGTTC TCTGCACAGA TAAGGACAAA CATTATTCAG AGGGAGTACC 4320 
510 

511 CAGAGCTGAG ACTCCTAAGC CAGTGAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 4380 
512 

513 ATCACCGAAG CCTGATTCCG TAGAGCCACA CCTTGGTAAG GGCCAATCTG CTCACACAGG 4440 
514 

515 ATAGAGAGGG CAGGAGCCAG GGCAGAGCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 4500 
516 

517 CATTTGCTTC TGACATAGTT GTGTTGGGAG CTTGGATAGC TTGGACAGCT CAGGGCTGCG 4560 
518 

519 ATTTCGCGCC AAACTTGACG GCAATCCTAG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 4620 
520 
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521 GCCATCATGG TTCGACCATT GAACTGCATC GTCGCCGTGT CCCAAAATAT GGGGATTGGC 
522 

523 AAGAACGGAG ACCTACCCTG GCCTCCGCTC AGGAACGAGT TCAAGTACTT CCAAAGAATG 
524 

525 ACCACAACCT CTTCA6TGGA AGGTAAACAG AATCTGGTGA TTATGGGTAG GAAAACCTGG 
526 

527 TTCTCCATTC CTGAGAAGAA TCGACCTTTA AAGGACAGAA TTAATATAGT TCTCAGTAGA 
528 

529 GAACTCAAAG AACCACCACG AGGAGCTCAT TTTCTTGCCA AAAGTTTGGA TGATGCCTTA 
530 

531 AGACTTATTG AACAACCGGA ATTGGCAAGT AAAGTAGACA TGGTTTGGAT AGTCG6AGGC 
532 

533 AGTTCTGTTT ACCAGGAAGC CATGAATCAA CCAGGCCACC TTAGACTCTT TGTGACAAGG 
534 

535 ATCATGCAGG AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 
536 

537 CTTCTCCCAG AATACCCAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 
538 

539 AAGTTTGAAG TCTACGAGAA GAAAGACTAA CAGGAAGATG CTTTCAAGTT CTCT6CTCCC 
540 

541 CTCCTAAAGC TATGCATTTT TATAAGACCA TGGGACTTTT GCTGGCTTTA GATCAGCCTC 
542 

543 GACTGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 
544 

545 CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATAAAAT 6AGGAAATTG CATCGCATTG 
546 

547 TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 
548 

549 TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGGGGTGGGG TCTATGGAAC CAGCTGGGGC 
550 

551 TCGAGCTACT AGCTTT6CTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 
552 

553 AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 
554 

555 AGACAGTGTT CTCTGCACAG ATAAG6ACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 
556 

557 GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 
558 

559 GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 
560 

561 GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 
562 

563 CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCTCTATG GTTGAACAAG ATGGATTGCA 
564 

565 CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 
566 

567 AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGG6CGCC CGGTTCTTTT 
568 

569 TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 
570 

571 GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 
572 
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573 AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 6240 
574 

575 TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA CGCTTGATCC 6300 
576 

577 GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC GTACTCGGAT 6360 
578 

57 9 GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC TCGCGCCAGC 6420 
580 

581 CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG TCGTGACCCA 6480 
582 

583 TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG GATTCATCGA 6540 
584 

585 CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA CCCGTGATAT 6600 
586 

587 TGCTGAA6AG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG GTATCGCCGC 6660 
588 

589 TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT GAGCGGGACT 6720 
590 

591 CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA TTTCGATTCC 6780 
592 

593 ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATCGTTT TCCGGGACGC CGGCT6GATG 6840 
594 

595 ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC ACCCCAACTT GTTTATTGCA 6900 
596 

597 GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT 6960 
598 

599 TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATC TATCTTATCA TGTCTGGATC 7020 
600 

601 GCGGCCGCGA TCCCGTCGAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT CCTGTGTGAA 7 080 

602 

603 ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG AAGCATAAAG TGTAAAGCCT 7140 
604 

605 GGGGTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC 7200 
606 

607 AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG 7260 
608 

609 GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC TCGGTCGTTC 7320 
610 

611 GGCTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT ACGGTTATCC ACAGAATCAG 73 80 

612 

613 GGGATAACGC AGGAAAGAAC AT6TGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA 7440 
614 

615 AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC 7500 
616 

617 GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC 7560 
618 

619 CT6GAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 7 620 

620 

621 CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG TATCTCAGTT 7680 
622 

623 CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC 7740 
624 
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625 GCTGCGCCTT ATCCG6TAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 
626 

627 CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 
628 

629 AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 
630 

631 CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA 
632 

633 CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG 
634 

635 GATCTCAAGA AGATCCTTTG ATCTTTTCTA CG6GGTCTGA CGCTCAGTGG AACGAAAACT 
636 

637 CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA 
638 

639 ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 
640 

641 ACCAATGCTT AATCAGTGAG GC7VCCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG 
642 

643 TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 
644 

645 GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 
646 

647 AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT 
648 

649 CTATTAATTG TTGCC6GGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 
650 

651 TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTT6GTATG GCTTCATTCA 
652 

653 GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 
654 

655 TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCC6CAGTG TTATCACTCA 
656 

657 TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCC6TAAGA TGCTTTTCTG 
658 

659 TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT 
660 

661 CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA 
662 

663 TCATTGGAAA ACGTTCTTCG G6GCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA 
664 

665 6TTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 
666 

667 TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 
668 

669 66AAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 
670 

671 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC 
672 

673 CGCGCACATT TCCCCGAAAA GTGCCACCT 
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677 

678 (i) SEQXTENCE CHARACTERISTICS: 

680 (A) LENGTH: (^^sl bas^) L-S-"^ CjLJ''^— 1 '^■^ '^'^J^^ • 

681 (B) TYPE: nucl&tc-ircid ' ^.^x^ 

682 (C) STRANDEDNESS: single 

683 (D) TOPOLOGY: linear 
684 

685 (ii) MOLECULE TYPE: DNA (genomic) 
686 

687 (iii) HYPOTHETICAL: no 
688 

689 (iv) ANTI-SENSE: no 
690 

691 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 3: 

693 ATC ACA GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT 

694 TC /54j 

696 (5) INFORMATION FOR SEQ ID NO.: 4: H ^ 

697 ^ 

698 (i) SEQUENCE CHARACTERISTICS: 
699 

700 (A) LENGTH: 3 0 bases 

701 (B) TYPE: nucleic acid 

702 (C) STRANDEDNESS: single 

703 (D) TOPOLOGY: linear 
704 

705 (ii) MOLECULE TYPE: DNA (genomic) 
706 

707 (iii) HYPOTHETICAL: no 
708 

709 (iv) ANTI-SENSE: yes 
710 

711 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 4: 

712 

713 TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT 30 
714 

715 (6) INFORMATION FOR SEQ ID NO.: 5: 

716 

717 (i) SEQUENCE CHARACTERISTICS: 

718 

719 (A) LENGTH: 384 bases 

720 (B) TYPE: nucleic acid 

721 (C) STRANDEDNESS: single 

722 (D) TOPOLOGY: linear 
723 

724 (ii) MOLECULE TYPE: DNA (genomic) 
725 

726 (iii) HYPOTHETICAL: no 
727 

728 (iv) ANTI-SENSE: no 
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729 

730 (ix) SEQUENCE DESCRIPTION: SEQ ID NO. : 5: 

731 

732 

733 ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC AGT GCT TCA GTC 51 
734 

735 ATA ATG TCC AGA GGG CAA ATT GTT CTC TCC CAG TCT CCA GCA ATC CTG TCT 102 
736 

737 GCA TCT CCA GGG GAG AAG GTC ACA ATG ACT TGC AGG GCC AGC TCA AGT GTA 153 
738 

739 AGT TAC ATC CAC TGG TTC CAG CAG AAG CCA GGA TCC TCC CCC AAA CCC TGG 204 
740 

741 ATT TAT GCC ACA TCC AAC CTG GCT TCT GGA GTC CCT GTT CGC TTC AGT GGC 255 
742 

743 AGT GGG TCT GGG ACT TCT TAC TCT CTC ACA ATC AGC AGA GTG GAG GCT GAA 306 
744 

745 GAT GCT GCC ACT TAT TAC TGC CAG CAG TGG ACT AGT AAC CCA CCC ACG TTC 357 
746 

747 GGA GGG GGG ACC AAG CTG GAA ATC AAA 384 

748 

749 

750 (7) INFORMATION FOR SEQ ID NO. : 6: 

751 

752 (1) SEQUENCE CHARACTERISTICS: 

753 

754 (A) LENGTH: 27 bases 

755 (B) TYPE: nucleic acid 

756 (C) STRANDEDNESS : single 

757 (D) TOPOLOGY: linear 
758 

759 (ii) MOLECULE TYPE: DNA (genomic) 
760 

761 (iii) HYPOTHETICAL: no 
762 

763 (iv) ANTI-SENSE: no 
764 

765 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 6: 

766 

767 

768 GCG GCT CCC ACG CGT GTC CTG TCC CAG 27 

769 

770 

771 (8) INFORMATION FOR SEQ ID NO, : 7: 

772 

773 (i) SEQUENCE CHARACTERISTICS: 

774 

775 (A) LENGTH: 2 9 bases 

776 (B) TYPE: nucleic acid 

777 (C) STRANDEDNESS: single 

778 (D) TOPOLOGY: linear 
779 

780 (ii) MOLECULE TYPE: DNA (genomic) 
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781 

782 (iii) HYPOTHETICAL: no 
783 

784 (iv) ANTI-SENSE: yes 
785 

786 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 7: 

787 

788 GGS TGT T6T GCT AGC TGM RGA GAC RGT GA 29 
789 

790 (9) INFORMATION FOR SEQ ID NO.: 8: 

791 

792 (i) SEQUENCE CHARACTERISTICS: 

793 

794 (A) LENGTH: 420 bases 

795 (B) TYPE: nucleic acid 

796 (C) STRANDEDNESS : single 

797 (D) TOPOLOGY: linear 
798 

799 (ii) MOLECULE TYPE: DNA (genomic) 
800 

801 (iii) HYPOTHETICAL: no 
802 

803 (iv) ANTI-SENSE: no 
804 

805 (ix) SEQUENCE DESCRIPTION: SEQ ID NO.: 8: 

806 

807 

808 ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT GTC 51 
809 

810 CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG CCT GGG 102 
811 

812 GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT ACC AGT TAG 153 
813 

814 AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG GAA TGG ATT GGA 204 
815 

816 GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT CAG AAG TTC AAA GGC 255 
817 

818 AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAG CTC 306 
819 

820 AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC TAT TAC TGT GCA AGA TCG ACT 357 
821 

822 TAC TAC GGC GGT GAC TGG TAC TTC AAT GTC TGG GGC GCA GGG ACC ACG GTC 408 
823 

824 ACC GTC TCT GCA 420 

825 

826 

827 

828 

829 

830 

831 

832 
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